Improving prosodic phrase prediction by unsupervised adaptation and syntactic features extraction

نویسندگان

  • Zhigang Chen
  • Guoping Hu
  • Wei Jiang
چکیده

In the state-of-the-art speech synthesis system, prosodic phrase prediction is the most serious problem which leads to about 40% of text analysis errors. Two optimization strategies are proposed in this paper to deal with two major types of prosodic phrase prediction errors. First, unsupervised adaptation method is proposed to alleviate the mismatching problem between training and testing. Second, syntactic features are extracted from parser and integrated into prediction model to ensure that the consistency between the predicted prosodic structure and the syntactic structure. We examine our methods on an in-house Mandarin speech synthesis system and experiment results show that both strategies yield positive effects and the sentence unacceptable rate significantly drops from 15.9% to 8.75%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Mandarin prosodic boundary prediction with rich syntactic features

Previous researches indicated that the performance of automatic prosodic boundary labeling benefited from syntactic phrase information for Mandarin. However, the influence of other syntactic features such as dependency has not been studied in-depth yet, especially on large scale corpus. This paper demonstrates the usefulness of rich syntactic features for Mandarin phrase boundary prediction. Bo...

متن کامل

End-of-Utterance Prediction by Prosodic Features and Phrase-Dependency Structure in Spontaneous Japanese Speech

This study is aimed at uncovering a way that participants in conversation predict end-of-utterance for spontaneous Japanese speech. In spontaneous everyday conversation, the participants must predict the ends of utterances of a speaker to perform smooth turn-taking without too much gap. We consider that they utilize not only syntactic factors but also prosodic factors for the end-of-utterance p...

متن کامل

Unsupervised Extraction of Prosodic Structure

Our approach for unsupervised extraction of prosodic structure in spontaneous speech consists of the four steps: chunking into interpausal units, syllable nucleus extraction, prosodic boundary detection, and pitch accent detection. The extraction is based on acoustic features derived from F0 parameterization, and on energy and segment duration features. Phrase boundaries and accents are detecte...

متن کامل

Phonetic pitch movements of accentual phrases in Korean read speech

The minor prosodic unit in Korean language, generally called an Accentual Phrase, is usually defined by its syntactic or phonological characteristics. This article looks at the correlation between phonetic pitch movements and accentual phrase boundaries using a technique of pattern extraction and prediction by a probabilistic grammar.

متن کامل

Use of Prosodic Features in Speech Recognition

Two methods were proposed for the use of prosodic features in speech recognition: one to detect major syntactic (phrase) boundaries as the initial phase of speech recognition, and the other to check the feasibility of the results of ordinary recognition process from the viewpoint of prosodic features. In the rst method, fundamental frequency contours were assumed as waveforms as functions of ti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010